Integration of Maximum Information Using Outerjoins, Predicates and Foreign Functions

نویسنده

  • Koichi MUNAKATA
چکیده

The goal of this paper is to present algorithms for creating an optimized query plan for retrieving maximum information from multiple relations, using outerjoins. Especially we focus on conjunctive queries in the presence of predicates and foreign functions. We show first with examples that retrieving maximum information by integrating multiple relations requires outerjoin operators. The outerjoin is essential to prevent information loss that would be caused by the inner join. We also show that predicates and foreign functions are useful to mediate the discrepancy among the relations and to create arbitrary views. Outerjoins and foreign functions, together with predicates, make it difficult to create query processing plans since they impose restrictions on the order of query processing. The rest of this paper describes algorithms for creating such query processing plans for conjunctive queries expressed in extended Datalog. First, we show simple algorithms for creating query plans with outerjoins, but without predicates and foreign functions. We use the hypergraph representation of the relations to explain an optimized algorithm. Then, we show a more complex algorithm that works for query plans with predicates and foreign functions. In our algorithm, we create an initial expression graph whose nodes represent query processing units, including outerjoin, predicate and foreign function operators. Then, we convert the initial expression graph into an executable, optimized expression tree. This algorithm is implemented and deployed in a mediation system that integrates heterogeneous information sources. key words: database, query processing, outerjoin, integration

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Increasing the Effectiveness of Russian Language Teaching for Special Purposes (to the Problem of Integration of Language Training with Information Technology Courses)

The article is devoted to the problem of increasing the efficiency of language teaching for the special purposes of foreign students in studying Russian at a technical university. Particular attention is paid to the training of foreign students in the skills of working with information using the latest computer technology. The conclusions of the work are based on the analysis of the results of ...

متن کامل

Comparing Very Large Database Snapshots

Detecting and extracting modi cations from information sources is an integral part of data warehousing. For unsophisticated sources, in practice it is often necessary to infer modi cations by periodically comparing snapshots of data from the source. We call this problem the snapshot di erential problem. We show that this is closely related to outerjoins. In this paper we extend the traditional ...

متن کامل

The Interactive Relationship between Regional Trade Integration and Foreign Direct Investment

E conomic integration among countries has continued to deepen over the past decade. This is especially visible at the regional level, with the escalation of Regional Integration Agreements (RIAs) ranging from Free Trade Areas (FTAs) to Customs Unions (CUs). Nowadays, many developing countries have entered a new regional integration agreement with developed and developing countries. S...

متن کامل

Integrating Information by Outerjoins and Full Disjunctions 1 I. Motivation

Our motivation is the piecing together tidbits of information found on the \web" into a usable information structure. The problem is related to that of computing the natural outerjoin of many relations in a way that preserves all possible connections among facts. Such a computation has been termed a \full disjunction" by Galindo-Legaria. We are thus led to ask the question of when a full disjun...

متن کامل

Partial Natural Outerjoin - An Operation for Interoperability in a Multidatabase Environment

Natural outerjoin has been considered as an important operation in schema integration. It can be used to define various views in cooperation with other operations. Due to the existence of inconsistent data and null values in base relations of multiple databases, the traditional natural outerjoin cannot be directly applied to schema integration in a multidatabase environment. In this paper, the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999